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SELF -ASSEMBLING BIOMOLECULAR STRUCTURES 

BACKGROUND OF THE INVENTION 

1 . Field of the Invention 

5 This invention relates to self -assembling 

biomolecular structures, more particularly to such 
structures formed using modular units capable of 
biospecif ic interaction. 

2 . Description of the Related Art 

10 The assembly of ordered macromolecular protein 

structures is well known in nature, for example in the 
construction of transcriptional complexes in eucaryotes 
and in mechanically stabilising structures such as 
collagen in bone. 

15 Peptides which contain alternating hydrophilic and 

hydrophobic amino acid residues and which are capable of 
spontaneous formation of macromolecular/macroscopic 
membranes are also known: see, for example, US-A- 
5670483. Such macroscopic materials, which may be held 

2 0 together by a combination of hydrophobic and ion-pair 
interactions, may be useful biomaterials for medical 
products such as slow release drug delivery vehicles, 
artificial sutures etc. It will be appreciated that the 
need to employ component molecules which are 

25 complementary and/or structurally compatible inevitably 
limits the application of such techniques. 

It has been described by Petka, W.A. et al in 
Science (1998) Vol. 281 pp 389-391 that hydrogels may be 
formed from a single class of multi-domain protein, here 

30 non-specific interactions are involved as unstructured 
aggregation occurs based on interhelical hydrophobic 
interactions . 

OBJECTS AND SUMMARY OF THE INVENTION 
The present invention is based on the finding that 

35 a wide variety of self -assembling biomolecular 

structures may be formed using self -organising modular 
units which are capable of biospecif ic interactions. 
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Particularly of interest are protein structures formed 
using self -organising modular protein units. The 
various modular units, hereinafter referred to as 
affinity modules, typically comprise appropriate 
5 combinations of protein molecules selected to convey the 
required capability for biospecific interaction. Thus, 
for example, where A and A* represent a first pair of 
proteins having specific affinity for each other (a 
first affinity pair) , and B and B* represent a second 
10 pair of proteins having specific affinity for each other 
(a second affinity pair) , affinity modules A-A and A*- 
B*-A* may be used to create self -assembling linear 
structures of the type 

15 ... (A-A) - (A*-B*-A*) - (A-A-) - (A*-B*-A*) . . . 



Such linear structures only require a single affinity 
pair. B* is acting in this Example as a linker moiety 
for the two A* proteins. Additional use of B-B affinity 
2 0 modules may permit the assembly of sheet -like structures 
of the type 

. . . (A-A) - (A*-B*-A*) - (A-A-) - (A*-B*-A*) . . . 
25 B B 

I I 

B B 

I I 

. . . (A-A) - (A*-B*-A*) - (A-A-) - (A*-B*-A*) . . . 



30 



35 



Three dimensional structures may similarly be 
assembled using an appropriate selection of affinity 
modules. Thus, for example, A-A, A*-B*-A* and b-B*-B 
modules may self -assemble to form sheets of the type 



. . . (A-A) - (A*-B*-A*) - (A-A-) - (A*-B*-A*) . . . 

B B 

I I 

40 B* B* 

I I 
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B B 
I I 

. . . (A-A) - (A*-B*-A*) - (A-A-) - (A*-B*-A*) . . . 

5 Two such sheets may be linked together by using B-B 
modules to bind the B* moieties in the B-B*~B linking 
groups of adjacent sheets, so as to form a two- sheet 
structure . 

It will be appreciated from the foregoing that by 
10 using two or more affinity pairs of proteins (which term 
as used herein with regard to the invention also 
embraces appropriate peptides, polypeptides, protein 
domains and fragments etc.) which have specific and 
exclusive affinity for each other, by choosing 
15 appropriately constructed affinity modules in 

appropriate relative amounts, and where necessary by 
interacting the affinity modules in an appropriate 
order, it is possible to form self -assembled protein 
structures in a highly specific, predictable and 

2 0 controllable manner and with widely ranging degrees of 

complexity. 

Thus according to one aspect of the invention there 
is provided a self -assembled protein structure 
comprising at least two pairs of proteins (affinity 
25 domains) wherein the proteins of each of said pairs have 
specific and exclusive affinity for each other, said 
proteins being linked to form affinity modules capable 
of mutual biospecific interaction to form an assembled 
structure . * 

3 0 More generally, and having applications which are 

discussed in more detail below, the self -assembled 
structure may comprise non-proteinaceous components and 
thus in a further aspect, the present invention provides 
a self -assembled biomolecular structure comprising 
35 affinity modules, said affinity modules each having at 
least two affinity domains which may be the same or 
different, at least one affinity domain within each 
affinity module having specific and exclusive affinity 
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for an affinity domain within another affinity module, 
said affinity modules capable of biospecific interaction 
to form an assembled structure. 

Preferably, the self -assembled biomolecular 
5 structure will comprise at least two pairs of affinity 
domains, each of said pairs having specific and 
.exclusive affinity for each other. In one preferred 
embodiment, each affinity domain within a given affinity 
module will have specific and exclusive affinity for 

10 another affinity domain. 

Preferably, the self -assembled structure will 
comprise at least two different types of affinity 
modules. Preferably at least one of the affinity 
modules will have two or more different affinity 

15 domains. In a particularly preferred embodiment, the 

structure comprises at least one affinity module which 
has two or more functionally equivalent affinity domains 
and at least one affinity module which has two or more 
different affinity domains. 

20 Preferably the affinity modules and affinity 

domains will be proteinaceous but the self -assembled 
structure may incorporate non-proteinaceous molecules 
capable of specific interactions with other molecules 
(typically proteins) such as polynucleotides, 

25 polysaccharides, haptens, vitamins, lipids etc. Thus, 
one or more of the affinity domains or affinity modules 
may comprise such a non-proteinaceous molecule. 
Affinity domains will be considered 1 proteinaceous 1 even 
though they may contain a non-protein part such as a co- 

30 factor. In one preferred embodiment, affinity modules 
which comprise affinity domains allowing mutual 
recognition will also comprise domains capable of 
binding to a non-proteinaceous ligand such as a 
polynucleotide, polysuccaride etc. as mentioned above. 

3 5 Affinity modules can be designed in such a way that the 
full self -assembling structure is only formed in the 
presence of this non-proteinaceous ligand, thus 
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providing a useful diagnostic tool where the self- 
assembled structure can be readily detected. 

Such a technique is also applicable for detecting 
proteinaceous ligands, affinity modules being designed 
5 with domains which are capable of specific interaction 
with the proteinaceous ligand of interest which is then 
incorporated into the self -assembled structure. 
Alternatively a ligand, which need not be proteinaceous, 
could be encapsulated by interaction with affinity 

10 modules for therapeutic use, e.g. drug delivery, 
possibly in a controlled release system. 

The affinity domains are identifiable molecular 
units having the binding affinities discussed herein. 
The term 'domain' is used functionally to refer to a 

15 unit having a certain binding affinity and does not 

imply a particular structural arrangement. The affinity 
domains may be naturally occurring molecules or 
fragments thereof or such molecules which have been 
modified or molecules designed de novo. Preferably, at 

20 least one of the affinity domains is an 'affibody', a 
protein domain which has been engineered to have a new 
or enhanced binding affinity for a target molecule. 
Engineering may be by combinatorial engineering or 
possibly by rational design. 

25 The present invention relates only to non-naturally 

occurring self -assembled structures. While one or more 
of the affinity domains or affinity modules may be 
naturally occurring (although conveniently prepared by 
recombinant techniques) the overall structures are not 

3 0 the same as those which are formed in nature. 

Preferably, one or more of the affinity domains will 
have been selected from a molecular library. As 
described in more detail in the present Examples, a 
domain which has good affinity for a molecule which is 

35 to form another affinity domain within the self- 

assembled structure can be selected from a displayed 
molecular library. Molecular libraries can be prepared 
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by combinatorial engineering of a given protein or 
protein domain and candidate affinity domains selected.. 

Alternatively, libraries can also be constructed by 
harvesting collections of genes from natural sources, 
5 which have already in a sense been randomised 

(Hoogenboom et al . , Immunotechnology 4, pp. 1-20 
[1998] ) . Convenient affinity domains include bacterial 
receptors, domains derivatives and fragments thereof. 
Each affinity domain thus has its own specific 

10 binding partner, i.e. an affinity domain with which it 
enjoys exclusive and specific interaction, forming an 
affinity pair. In a given self -assembled biomolecular 
structure, affinity modules may comprise affinity 
domains which are not able to take part in the specific 

15 interactions because a suitable binding partner is not 
present. For example in Examples 4 and 5 ABD is not 
able to undergo specific interactions with an affinity 
partner and the structure depends only on interactions 
between the Z 8G domains and SPA. A further type of 

20 affinity module could be provided which incorporated, 
for example, an albumin affinity domain and the 
structure would then comprise 2 affinity pairs, 2 pairs 
of specifically interacting affinity domains. 

Reference is made herein to "specific and exclusive 

25 affinity" between affinity domains. The terms does not 
imply that a given affinity domain could only ever 
interact with a single molecular species, rather, within 
the system each affinity domain has a clearly 
identifiable affinity partner with which it interacts 

3 0 much more strongly than any other domain present. The 
affinity domains Z 8G and Z 6S described in the Examples 
both have 'specific and exclusive affinity' for SPA 
within the meaning of the present invention, i.e. each 
is capable of forming an affinity pair with a domain of 

35 SPA. 

The interactions may be based on contributions from 
a number of different types of interactions including 
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hydrogen bonds, these interactions typically being 
strongly sequence and conformation dependent. The 
affinity between domains is specific, rather than 
relying on general interactions based on broad 
5 functional characteristics such as hydrophobicity which 
could be displayed by a large group of molecules. 
Typical of the specific and exclusive affinity observed 
between affinity domains in the present invention are 
protein: protein, antigen : antibody type interactions. 

10 The aforesaid affinity, modules, which themselves 

constitute a feature of the invention, may for example 
comprise naturally small and highly soluble protein 
affinity domains, such as individual domains of the 
bacterial surface receptors of staphylococcal protein A 

15 (SPA) or streptococcal protein G (SPG) (Stahl, S. and 
Nygren, P-A. , Patologie Biologie 45, pp. 66-76 [1997]; 
Jansson, B. et al . , FEMS Microbiology and Immunology 20, 
pp. 69-78 [1998]). Domains having affinity therefor may 
be found in collections or libraries of individual 

20 domains of SPA (Nord, K. et al . , Nature Biotechnology 

15, pp. 772-777 [1997]), SPG, antibody fragments such as 
single-chain Fv, Fv or Fab-fragments (Hoogenboom et al . , 
I mmuno techno logy 4, pp. 1-20 [1998]), other protein 
domains (Nygren, P. -A. and Uhlen, M . , Curr. Opin. 

25 Struct. Biol. 7, pp. 463-469 [1997]) or linear or cyclic 
peptides (Cortese, R. et al., Curr. Opin. Biotechnol . 6, 
pp. 73-80 [1995]), for example created using 
combinatorial protein engineering technology; affinity 
pairs may also be found from such collections or 

30 libraries. Identification of variants having the 

desired binding capabilities may be performed using 
standard in vitro selection techniques such as phage 
display technology (Clackson, T. and Wells, J., Trends 
Biotechnol. 12, pp. 173-184 [1994]; Smith, G.P. and 

35 Petrenko, V.A. , Chem. Rev. 97, pp. 391-410 [1997]), 

ribosomal display (Hanes, J. and Pluckthun, A., Proc . 
Natl. Acad. Sci . U.S.A. 94, pp. 4937-4942 [1997]), 
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peptides on plasmids (Schatz, P.J., Bio/technol. 11, pp. 
1138-1143 [1993]) or mRNA-protein coupling using 
pyromycin (Roberts, R.W. and Szostak, J.W. , Proc . Natl. 
Acad. Sci. U.S.A. 94, pp. 12297-12302 [1997]). 
5 Proteins which form all or part of an affinity 

module or domain may have been made by randomisation 
(random mutagenesis) of a particular protein to generate 
ligands with novel, modified or enhanced binding 
characteristics. These proteins are referred to as 

10 combinatorial proteins. Such a technique typically 
involves random mutagenesis of a target protein, 
expression of the full library of these variants, e.g. 
on the surface of filamentous bacteriophage, followed by 
selection of a protein exhibiting the desired binding 

15 characteristics, this selection typically involving a 
binding reaction between the variant protein and an 
immobilised ligand (binding partner) i.e. target 
molecule for the protein, e.g. target analyte. The 
mutagenesis is random in that the resulting amino acid 

2 0 encoded by any particular codon is not generally pre- 
determined but the positions where mutations are to be 
introduced are generally identified in advance. The 
mutagenesis may involve amino acid substitution, 
deletion, or addition (e.g. insertion). Mutagenesis of 

25 the Z domain derived from domain B of SPA is shown in 
Fig. 2. 

The use of an expression system such as surface 
display on phage provides a crucial link between 
genotype and phenotype; there is a self-contained unit 

30 which can be selected on the basis of its specific 

binding interactions and which also carries the nucleic 
acid encoding for the protein responsible for the 
observed binding characteristics. This enables 
identification and subsequent expression in useful 

35 amounts of the protein selected for its binding 

characteristics, such expression typically taking place 
in a transformed bacterial host. . 
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The term 'surface display' refers to the technique 
involved in selection of a protein from a library of 
molecules which are presented (displayed) in a manner 
which enables differentiation between the protein 
5 molecules on the basis of their binding characteristics. 
Surface display is typically performed on the surface 
of filamentous bacteriophage (phage display) but display 
can also be on the surface of bacteria, yeast cells 
ribosomes or using viral systems. Any "surface display" 

10 technique known or proposed in the art, may be used 

according to the present invention in the selection of 
affinity domains. 

Techniques for construction of a combinatorial 
library of protein molecules and subsequent selection to 

15 obtain proteinaceous ligands having desired binding 
characteristics are known in the art (Nygren, P. and 
Uhlen, supra (1997) 7: 463-469) . 

Generally, a protein molecule, perhaps having 
intrinsic beneficial properties such as temperature or 

20 pH insensitivity, is used as a scaffold and a 

combinatorial library is then constructed via random but 
targeted amino acid substitutions (or other mutations) 
of that protein molecule, in order to produce a library 
of molecules having different binding characteristics. 

2 5 Surface residues are generally targeted for random 
mutagenesis . 

Suitable protein scaffolds may simply be linear 
peptides but preferably the scaffold will possess a 
folded three dimensional structure which has the 

30 potential for higher affinities and is less susceptible 
to proteolytic . degradation . Rather than designing a 
scaffold de novo, naturally existing proteins or domains 
are usually selected for further engineering. The 
choice of protein scaffold depends on several parameters 

35 including an ability to be effectively expressed in a 
desired host organism e.g E. coli when the randomised 
protein is to be displayed as a fusion protein with a 
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filamentous phage coat protein. The protein should also 
comprise sufficiently large regions on its surface which 
are tolerant to substitution (or insertion or deletion 
etc.) without losing the overall three dimensional 
5 structure. If the library is to be produced 

synthetically, a small overall size is a prerequisite. 
Where the selected scaffold protein has a binding 
function, amino acid residues involved in that 
interaction may be a target for randomisation. 

10 Randomisation may be performed in order to enhance known 
binding properties or to develop ligands with new 
specificities . 

Suitable scaffold molecules are discussed in Nygren 
et al. (1997, supra) and include cyclic peptides having 

15 4 0 or more residues in a constrained sequence, 

immunoglobulin- like scaffolds including Fv or single- 
chain (scFv) domains, bacterial receptors such as the 
58-residue one-domain Staphylococcal protein A (SPA) 
analogue Z (the "Z Domain 11 being a derivative of the B 

2 0 domain of SPA) , or other domains or analogues of SPA, 
DNA-binding proteins particularly zinc fingers and 
protease inhibitors. All these molecules can be 
stabilised by substituting one or more native asparagine 
residues with a less alkaline sensitive amino acid 

25 before a combinatorial library of the pre-stabilised 
protein is made. 

Of particular interest is the bacterial receptor 
domain Z (Nord, K. , Nilsson, J., Nilsson, B., Uhlen, M. 
and Nygren, P. Protein Engineering (1995) 8., 6, 601- 

30 608) . This paper by Nord et al . describes a suitable 
method of constructing a combinatorial library of 
protein molecules which can be applied to a range of 
scaffold molecules. The method described is solid- 
phase-assisted and based on the stepwise assembly of 

35 randomised single-stranded oligonucleotides. 

Selection from a generated protein library can be 
performed in a number of different ways known in the 
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art, including bead immobilised libraries (McBride, 
J.D., Freeman, N. , Domingo, G.J. and Leatherbarrow, 
R.J., J. Mol. Biol. [1996] 259: 819-827), fusions to 
DNA-binding proteins (Schatz, P.J., Biotechnology [1993] 
5 11, 1138-1143) and when displayed on bacteria (Lu, Z., 
Murray, K.S., Van Cleave, V., LaVallie, E.R., Stahl, 
M.L. and McCoy, J.M. Biotechnology [1995], 13. 366-372) 
or phage (Clackson, T. and Wells, J. TIBTECH (1994) 12, 
173-183) as well as yeast cells, (Boder, E.T. and 

10 Wittrup, K.D., Nature Biotechnology (1997) 15, 553-557) 
and in viral systems (Ernst, W. , Grabher, R., Wegner, 
D., Borth, N., Graussauer, A. and Katinger, H. Nucleic 
Acids Research (1998) 26, 1718-1723 and Grabher, R. , 
Ernst, W., Doblhof f -Dier , 0., Sara, M. and Katinger, H. 

15 BioTechniques (1997) 22, 730-735) . 

International Patent Application, publication No. 
WO 95/19374 describes the generation of a combinatorial 
library of Z-variants, see in particular Example 4. 

Thus in a further aspect, the present invention 

2 0 provides a method of preparing a combinatorial library 

of protein molecules, molecules selected from said 
library being for use in an affinity module as defined 
above. The invention also provides the use of a 
molecule selected from a molecular library as an 
25 affinity domain in a self -assembled biomolecular 
structure as described herein. 

In a yet further aspect of the present invention is 
provided a method of selecting a molecule for use in an 
affinity module (i.e. as an affinity domain) as defined 

3 0 above, wherein said molecule is selected from a 

molecular library on the basis of its ability to bind to 
a particular target molecule. The "target molecule" 
will typically comprise an affinity domain used in the 
self-assembly of a biomolecular structure, the selected 
35 molecule forming an affinity pair therewith. 

Albumin Binding Protein (ABD) , a protein domain 
with affinity towards human serum albumin (HSA) may be 
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used as an affinity domain or as a further, linker 
component of the affinity module which is attached to 
affinity domains. It is derived from a cell wall 
anchored bacterial receptor protein from Streptococcus 
5 G148. The wild type sequence of this protein has four 
asparagine residues, enhanced stability is observed when 
just one of these residues is substituted but preferably 
all four residues are replaced by less alkaline 
sensitive residues. The wild type amino acid sequence 
10 of ABD is: 

LAEAKVLANRELDKYGV- SDYYKNL INNAKTVEGVKAL I DE I LAAL P 

the asparagine residues have been indicated in bold. 
The hyphen simply indicates that other molecules in the 
15 same family have an additional amino acid in this 
position. 

ABD may be subjected to randomisation to create a 
protein (i.e. a combinatorial protein) having e.g. 
modified binding characteristics for HSA and/or that is 

20 able to bind any target of choice and also retains 
affinity towards HSA. 

The affinity modules may be synthesised by 
recombinant DNA technology, by transformation of a 
suitable host cell with a plasmid or viral construct . 

25 encoding for the affinity module protein, or by chemical 
synthesis . 

Chemical or genetic fusion may be employed as 
necessary to link affinity domains so as to form 
appropriate bi- or multi-valent or bi- or multi- 

30 functional affinity modules. Some modules such as the 5 
domain SPA will be expressed as a whole f module ! without 
the need to introduce regions into the nucleotide 
sequence which will provide linking regions or to modify 
the expressed protein to join the domains together. 

35 Alternatively, Example 3 describes how a genetic 

construct may be prepared to encode a fusion protein 
which can act as an affinity module. 
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Typically, the affinity between modules (i.e. 
between the domains thereof) should be in the range of 
K aff = 10 4 -10 11 M" 1 in order to allow stable self -assembled 
structures to form under a range of conditions, for 
5 example under physiological conditions or at different 
pHs and/or temperatures. For certain applications it 
may, however, be advantageous to prepare structures 
which may be reversibly disassembled as a result of, for 
example, pH or temperature change. 

10 The nature of self -assembled biomolecular 

structures formed in accordance with the invention may, 
for example, be investigated by techniques such as size 
exclusion chromatography, electron microscopy, viscosity 
analyses, microcalorimetry , nuclear magnetic resonance 

15 spectroscopy, atomic force microscopy and X-ray 
crystallography. 

In a further aspect, the present invention provides 
a proteinaceous affinity module which comprises two or 
more proteinaceous affinity domains which may be the 

2 0 same or different, each affinity domain having specific 

and exclusive affinity for a given binding partner, 
wherein at least one of the affinity domains has been 
selected from a molecular library. The use of such 

modules in the formation of a self assembled 
25 biomolecular structure constitutes a further aspect of 
the present invention. Nucleic acid molecules encoding 
such a protein as well as cells expressing the protein 
constitute further aspects of the present invention. 

As discussed above, 'molecular library' refers to a 

3 0 combinatorial protein library generated by mutagenesis 

of a starting molecule or the components of 'library 1 of 
molecules harvested from natural sources, a naive 
library as described by Hoogenboom et al . supra. 

Preferably, one or more of the affinity domains is 
35 capable of specific and exclusive interaction with a 

bacterial receptor domain such as SPA. Preferably one 
of the affinity domains is derived from a bacterial 
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receptor domain such as domain Z. The inventors have 
isolated particularly useful affinity domains which are 
derived from domain Z and are identified herein as Z on 

8G 

and Z 6s ; these protein molecules constitute further 
aspects of the present invention. The amino acid 
sequences of these affinity domains are given in Fig. 2. 

Also included within the scope of the present invention 
are variants of these molecules which differ only in the 
regions of the molecule which have not been randomized 
(i.e. not marked *) and wherein those modifications do 
not significantly affect the binding affinity of the 
domain as a whole for SPA. Modifications may include 
additions, substitution or deletion. Preferably, any 
modified domains will have at least 80%, more preferably 
at least 90% of the binding affinity of the Z 8G or Z 6s 
molecule for SPA. Preferred affinity modules comprise 
affinity domains linked to ABD. 

Nucleic acid molecules encoding the affinity 
domains and affinity modules described above, including 
vectors incorporating such nucleic acid and cells 
expressing such domains and modules constitute further 
aspects of the present invention. Methods of selecting 
affinity domains, for example by phage or ribosome 
display, for, a molecular library constitute a still 
further aspect of the present invention. 

Also, a method of producing a self -assembled 
biomolecular structure as defined herein, which 
comprises admixing affinity modules in an environment 
which enables interaction between the affinity domains 
thereof comprises a further aspect of the present 
invention. 

The structures have a wide range of therapeutic 
diagnostic and other applications. These structures 
which may be in the form of fibres, sheets or three 
dimensional structures may be useful as materials, e.g. 
for encapsulation of active agents or they may have a 
more active functional role, e.g. in bioelectronic 
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applications. A variety of applications are discussed 
below. 

Chemical groups and/or molecules (including further 
proteins) may if desired be incorporated into the 
5 affinity modules, for example by chemical coupling or 
genetic fusion, thereby permitting the preparation of 
assembled structures with a variety of properties and 
functionalities. Structures obtainable in accordance 
with the invention may thus have a wide range of 

10 applications, including, for example diagnostic 

applications, e.g. to obtain highly avid reagents; 
clinical applications, e.g. to obtain controlled 
delivery of therapeutics; nano- fabrication applications, 
e.g. to obtain spontaneous build up of ordered small - 

15 scale structures; biotechnological applications, e.g. to 
obtain thermally or chemically reversible protein 
networks; and provision of basis for stepwise enzymatic 
treatment of a substrate at a defined position. 

Thus, for example, in the formation of highly avid 

20 diagnostic reagents, effector molecules such as reporter 
enzymes or other detection molecules, including alkaline 
phosphatase, 0-galactosidase, horse radish peroxidase, 
fluourescein isothiocyanate, green fluorescent protein 
or derivatives thereof, lucif erase and/or additional 

25 affinity reagents such as biotin, streptavidin, 

cellulose binding domains, zinc fingers, antibody 
fragments or protein or peptide moieties may be 
chemically or genetically coupled to a suitable affinity 
module . 

30 In clinical applications enzymes or therapeutics 

(including vaccines) may be encapsulated in self- 
assembling biomolecular structures by mixing the 
affinity modules with the substance to be encapsulated. 
Controlled release of the active substance may occur 

35 upon, for example, a change in buffer or temperature 

conditions, or in vivo upon spontaneous disassembling of 
the structures. An advantage of the in vivo use of 
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self -assembling biomolecular structures is that these 
are normally biodegradable. 

Self -assembling biomolecular systems in accordance 
with the invention may be used to provide controlled 
5 "plugging" of cavities or holes, either ex vivo or in 
vivo, for example by controlled release of enzymes or 
other active substances. Thus, for example, protein 
self-assembly systems capable of preventing bleeding may 
be prepared by derivatisation of affinity modules with 

10 substances directing the assembly to specific sites 
around injuries. 

Self -assembled protein structures according to the 
invention may be used in the preparation of hydrogels 
capable of reversible gelation under selected buffer or 

15 temperature conditions, and in the preparation of 
protein fibres or films which may be used to form 
biodegradable materials such as threads or cloths. 

Following address-specific deposition of a suitable 
anchoring affinity module onto a bio-chip, higher forms 

2 0 of structures may be produced in a stepwise manner using 
further affinity modules in accordance with the 
invention. Such self-assembly may be used to create 
structures for use in the detection of or functional 
studies of biological samples. In addition, through 

25 derivatisation of affinity modules with suitable' 

substances such as enzymes or organic substances, a 
build up of complexes mimicking natural systems, for 
example substrate conversion by concerted action of 
several enzymes, or production of energy or light as in 

30 photosynthesis or photorespiration, may be accomplished. 
In bioelectronic applications the spontaneous 
assembly of affinity modules into ordered structures 
may, for example, serve as a means to create small 
circuits useful for conducting electric charges or 

35 currents. Suitably derivatised affinity modules, e.g. 
highly charged modules or modules into which molecular 
groups with high electron densities have been directly 
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or indirectly incorporated, may be used to prepare self- 
assembled structures useful as "wires" for nano- 
fabrication of electric circuit components such as 
resistors, transistors or capacitors.. Self -assembled 
5 structures may also be used for site-specific deposition 
of metallic materials such as silver or gold, for 
example in order to create anchoring points from which a 
circuit may be built up by attachment of appropriate 
affinity modules, for example using affinity modules 
10 containing cysteine residues to couple to deposited 
gold. 

BRIEF DESCRIPTION OF THE DRAWINGS 
The invention will now be further described by the 
following non-limiting examples in which: 

15 Figure 1 shows biosensor analysis of selected 

staphylococcal protein A-specific affibodies. In the 
analysis, affibodies were analyzed as fusion proteins to 
the serum albumin binding domain ABD. Upper panel: (A) 
Response curve obtained for injection of the 8G affibody 

2 0 over a protein A-coated surface; (B) Response curve 
obtained for injection of a control affibody (non- 
protein A binding) over a blank surface; (C) Response 
curve obtained for injection of a control affibody (non- 
protein A binding) over a protein A-coated surface. 

25 Lower panel: (A) Response curve obtained for injection 
of the 6S affibody over a protein A-coated surface; (B) 
Response curve obtained for injection of the 6S affibody 
over a blank surface; (C) Response curve obtained for 
injection of a control affibody (non-protein A binding) 

30 over a protein A-coated surface. 

Figure 2 provides the sequences of selected anti- 
staphylococcal protein A affibody affinity modules 8G 
(SEQ ID No. 2) and 6S (SEQ ID No. 3) . Amino acid 
sequences are shown in one- letter code. Also shown is 

35 the amino acid sequence of the wild type Z domain (SEQ 
ID No. 1) used as scaffold for library constructions. 
In addition, boxes corresponding to the three alpha- 
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helical regions of the domain are shown. Randomized 
positions are shown as asteriscs. 

Figure 3 shows SDS-PAGE analysis of proteins 
purified by protein AG or HSA-af f inity- chromatography 
from periplasmic preparations of a culture of E. coli 
cells harboring the 8G clone phagemid encoding the Z 8G - 
ABD fusion protein. Lane 1: HSA-af f inity purified 
material; Lane 2: HSA-af f inity purified material (1:10 
dilution) ; Lane 3 : Protein AG-affinity purified 
material; Lane 4: Protein AG-affinity purified material 
(1:10 dilution). In both cases, the Z 8G -ABD protein is 
purified to near homogeneity, showing upon selective 
interactions between the Z 8G moiety and protein A and the 
ABD moiety and HSA, respectively. Molecular weight 
markers are: 14, 20, 30, 43, 66, 92 kDa. 

Figure 4 shows construction and production of the 
divalent staphylococcal protein A-binding hybrid 
affinity module (Z 8G -ABD-Z 8G ) . The figure shows the 
cloning chart for the joining of two different PCR 
products by splice overlap extension (SOE) PCR to result 
in the gene encoding the Z 8G -ABD-Z 8G protein . 

Figure 5 shows SDS-PAGE analysis of purified Z 8G - 
ABD- Z 8G protein. The figure shows the results from an 
SDS-PAGE analysis of HSA-affinity chromatography 
purified protein (lane 1) or protein AG-purified protein 
(lane 2) from a periplasmic extract of E. coli cells 
harboring the pE-Z 8G -ABD-Z 8G construct . Molecular weight 
markers are: 14, 20, 30, 43, 66, 92 kDa. 

Figure 6 shows native PAGE analysis of complex 
formation between Z 8G -ABD-Z 8G protein and staphylococcal 
protein A. Lane 1: Z 8G -ABD-Z 8G protein; Lane 2: protein A 
protein; Lane 3: An aliquote from a mixed sample 
containing 10 fiM of Z 8G -ABD-Z 8G and 2.5 fiM of protein A; 
Lane 4: An aliquote from a mixed sample containing 10 /jlM 
of Z 8G -ABD-Z 8G and 5 fiM of protein A; Lane 5: An aliquote 
from a mixed sample containing 10 /iM of Z 8G -ABD-Z 8G and 
7.5 /xM of protein A; Lane 6: An aliquote from a mixed 
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sample containing 10 /zM of Z 8G -ABD-Z 8G and 10 /iM of 
protein A. 

Figure 7 provides atomic force microscopy images 
from analyses of complex formation between Z 8G -ABD-Z 8G 
5 protein and staphylococcal protein A. Left panel: 

Analysis of a dried 20 nM sample of protein A-cys by 
AFM . Right panel: Analysis of a dried 10 nM sample of 
protein A by AFM, in which the divalent protein A- 
binding affinity module Z 8G -ABD-Z 8G had previously been 
10 added to a final concentration of 2 0 nM. 

DESCRIPTION OF THE PREFERRED EMBODIMENTS 

EXAMPLES 



Standard procedures: 

15 Phagework including preparation of phage stocks and 

titration was performed as described above (Nord et al . 
(1997)). The phage libraries Zlib-1 and Zlib-2 have 
also been described above (Nord et al . , 1997). Standard 
cloning work was performed as described in (Sambrook, 

20 J., Fritsch, E. F. and Maniatis, T. Molecular cloning: a 
laboratory manual, 2nd edn. , Cold Spring Harbor 
Laboratory, New York, 1989) . Native gel protein 
electrophoresis was performed in a Novex Xcell II (San 
Diego, CA) as described by the supplier. PCR 

25 amplifications were performed using standard conditions 
using oligonucleotide primers from Table 1 as specified 
in the examples. 



Table 1. List of oligonucleotide primers. 

3 0 

Name Sequence 5 "-3' 



SNAP-1 GCG 
SNAP -2 AGG 
35 SNAP -3 GAT 
GTA 

SNAP -4 GCG 



AAT TCG GTA 
TAA TGC AGC 
GAA ATT TTA 
GAC AAC AAA 
CGC TGC AGT 



GAC AAC AAA TTC 
TAA AAT TTC 
GCT GCA TTA CCT 
TTC AAC AAA G 
TAG TCG ACT TTC 



AAC AAA G 
AGC GGC AGC 
GGC GCC TGA 
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GCA TCA T 

RIT2 7 GCT TCC GGC TCG TAT GTT GTG TG 

RIT28 AAA GGG GGA TGT GCT GCA AGG CG 

USP CGT TGT AAA ACG ACG GCC AG 

5 RSP TTC ACA CAG GAA ACA GCT ATG ACC 

NOKA-2 CGG AAC CAG AGC CAC CAC CGG 



E. coli cells used for phage work, cloning and 
expression were either RR1AM15 (Ruther, U. Nucl . Acids 

10 Res. 10: 5765-5772, 1982). Osmotic shock procedures 
were performed as described by (Nygren et al.,1988. 
Affinity chromatography purifications of proteins on IgG 
or HSA-Sepharose resins were also performed as described 
above (Nygren et al . , J. Mol . Recognit. 1:69-74, 1988). 

15 Human polyclonal IgG was supplied by Pharmacia and 
Upjohn AB, Stockholm. 

DNA sequencing was performed by cycle sequencing 
(Carothers et al . , BioTechniques 7:494-499, 1989) using 
ThermoSequenase DNA polymerase (Amersham Pharmacia 

2 0 Biotech) and primers as indicated. Sequencing reactions 

were loaded onto a MegaBace 1000 instrument (Amersham 
Pharmacia Biotech, Uppsala, Sweden) and the results were 
analyzed using Sequencher 3.1.1 software. 

Atomic force microscopy (AFM) was performed using a 
25 Nanoscope Ilia (Multimode) from Digital Instruments. 
Protein containing solutions were dried onto newly 
prepared mica surfaces and analyzed without further 
treatment . 

Biosensor experiments were performed using a 
30 Biacore 2000 instrument (Biacore AB, Uppsaal, Sweden) 

according to the suppliers recommendations. Sensorgrams 
were analyzed using Biaevaluation 2.1 software. CM-5 
sensor chips were used for coupling of proteins via 
amine cooling chemistry according to the suppliers 

3 5 recommendations .. 



Example 1. Phage display selection of a staphylococcal 
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protein A- specific binding protein. 

In order to isolate binding proteins capable of 
selective binding to staphylococcal protein A, phage 
display in vitro selection experiments were performed 
5 using protein libraries constructed by combinatorial 
engineering of a protein domain (Z) , derived from 
staphylococcal protein A (Nord et al . 1997) . The 
libraries Zlib-1 and Zlib-2 have been described 
previously (Nord et al . , 1997) and were constructed by 

10 randomization of thirteen surface located positions of 
the three-helix bundle domain Z. For the selection of 
binding proteins to the protein A, phage stocks from the 
libraries were prepared as described by Nord and 
coworkers (Nord et al . , 1997). As targets during 

15 selections, recombinant protein A was used (Amersham 
Pharmacia Biotech, Uppsala, Sweden) . 

Approximately 570 /xg of the purified protein A 
protein was subjected to in vitro biotinylation using a 
commercial biotinylation kit (Pierce, art. No 21335) 

20 based on amine coupling chemistry. For selections, 

approximately 40 fig of the biotinylated protein A was 
immobilized onto streptavidin coated paramagnetic beads 
(Dynal M2 80-SA, Dynal AS, Norway) according to the 
suppliers recommendations. This resulted in an 

25 immobilization of approximately 5 fig protein A per mg of 
beads. In each round 5 mg of beads were used. 

Selections were performed for five cycles using 
conditions for phage stock preparation as described 
earlier (Nord et al . 1997), with the addition of a 

3 0 preincubation step of phagestocks with streptavidin- 

coated beads (containing no biotinylated proteins) as to 
prevent the selection of streptavidin binding Z 
variants. In each round of selection, approximately 10 11 
cfu of the phagestocks (a 50/50 mixture of Zlib-1 and 

3 5 Zlib 2 -phage libraries) were used as input to 

approximately 5 mg streptavidin coated beads in final a 
volume of approximately 100 /zl . This resulted in a 
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final target concentration of approximately 7 /iM protein 
A. 

In the selection rounds, either low pH (0.1 M 
Glycin-HCl, pH 2.2) (S- indexed binders) or competition 
5 with a 100 -fold molar excess of human polyclonal IgG 
(300 fiM) (G-indexed binders) was used for elution. 
After five cycles of selections, aliquots of the 
respective eluates were used to infect RR1AM15 cells 
which were spread on selective media (ampicillin agar 
10 plates, 100 /zg/ml) . 

Example 2 . Characterization of a protein A-binding 
affibody protein. 

To investigate if resulting clones from the 

15 selection experiment corresponded to affibody variants 
capable of selective protein A binding, eight randomly 
picked clones were studied further. Cells harboring the 
different phagemid construct were cultivated in 100 ml 
shakeflasks in a tryptone soy broth growth medium (TSB, 

20 Difco) containing 100 //g/ml of ampicillin for production 
of selected affibody proteins as fusions to a serum 
albumin binding domain from streptococcal protein G, as 
encoded from their respective phagemid vectors. 
Proteins were purified by HSA-affinity chromatography 

25 (Nygren et al . , 1988) from samples prepared by osmotic 
shock of cells to release periplasmic material. 

Purified affibody-ABD fusion proteins were 
subsequently analyzed for protein A-binding using 
biospecific interaction analysis (BIA) in a biosensor 

30 instrument (Biacore 2000, Biacore AB, Uppsala, Sweden) . 
To a flow cell surface of a CM-5 sensor chip 
approximately 1200 RU protein A was coupled using amine 
coupling chemistry (NHS/EDC) according to the suppliers 
recommendations. Injections of samples of the different 

3 5 affibody-ABD fusion proteins resulted in the 

identification of two clones showing specific protein A- 
specific binding (Figure 1) . These two clones, 8G and 
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6S, had been selected using competitive human IgG (8G) 
and low pH-elution (6S) , respectively. The affibody 
genes in the phagemids of the two positive clones 8G and 
6S were subjected to DNA sequencing by cycle DNA 
5 sequencing using primers RIT27 and NOKA-2 for PCR 

amplification and primer NOKA-2 for sequencing reactions 
with dye-labeled dideoxy nucleotide terminators. The 
results from this sequencing is shown in figure 2, in 
which deduced amino acid sequences of the two affibody 
10 clones are shown. This result showed that affinity 
proteins showing specific binding to the highly 
repetitive staphylococcal protein A could be selected 
from the Z domain libraries. 

The selectivity in binding of the 8G clone was 
15 further demonstrated in the purification of the Z 8G -ABD 
fusion protein by protein AG-affinity chromatography 
(Jansson et al . , FEMS Immunol. Med. Microbiol. 20:69-78, 
1998) from an osmotic shock preparation of a culture of 
RR1AM15 cells harboring the corresponding phagemid 
20 vector. Analysis of low pH-eluted proteins by SDS-PAGE 
(Figure 3, lane 1) showed upon a single band 
corresponding to the Z 8G -ABD fusion protein (molecular 
weight=13.5 kDa) . The same protein was also purified 
using HSA-affinity chromatography, employing the ABD 
25 moiety of the fusion protein (Figure 3, lane 2) . In 
both cases, a product of high purity was obtained 
showing that both moieties in the fusion protein are 
functional and bind their cognate binding partner with 
high selectivity when genetically linked as a fusion 
3 0 construct . 

Example 3. Construction of a divalent protein A-specific 
hybrid affinity module. 

A genetic construct was assembled encoding a fusion 
35 protein Z 8G -ABD-Z 8G , containing a serum albumin binding 

domain (ABD) , flanked both N-and C- terminally by the 8G 
affibody domain. Thus, this protein construct 



- 24 - 



corresponds to a divalent protein A-binding protein. 
The construct was assembled using splice overlap 
extension (SOE) PCR from two different PCR products 
(Figure 4) sharing common sequence corresponding to the 
5 intended junction. The first PCR fragment (Fragment A, 
figure 4) was produced by PCR amplification using 
primers SNAP-1 and SNAP-2 on a template consisting of 
phagemid vector pKNl-Z 8G resulting in a PCR product 
encoding the Z 8G and ABD domains. Similarly, the second 

10 PCR product (fragment B, figure 4) was also prepared by 
PCR using the pKNl-Z 8G phagemid as template but with the 
use. of primers SNAP- 3 and SNAP-4 resulting in a PCR 
product encoding the Z 8G affibody domain. The two 
fragments were subsequently joined through overlapping 

15 sequences in a SOE-PCR reaction, using approximately 5 
ng each of the two PCR products. For the amplification 
of the resulting joined fragment, 5 mole of outer PCR 
primers SNAP-1 and SNAP-4 were added to the PCR reaction 
tube before temperature cycling. The polymerase used 

20 for this procedure was Taq DNA polymerase (Perkin Elmer 
Corp.). The temperature cycling scheme for this 
procedure was: 96°C (30 sec), 40°C (30 sec) and 72°C (45 
sec) for ten cycles followed by 25 cycles of 96°C (15 
sec) , 55°C (30 sec) and 72°C (45 sec) . 

25 The resulting PCR product was ligated into a pGEM 

cloning vector (AT-cloning, Clontech) and the ligation 
mixture transformed to RR1AM15 cells. Resulting 
transf ormants (ampicillin containing agar plates) were 
PCR-screened using primers RIT 27 and RIT 28 (Hultman et 

30 al., Nucl. Acids Res. 17:4937-4946, 1989). and sequenced 
using primers USP and RSP (Amersham Pharmacia Biotech) . 
A clone with the correct sequence of the insert was 
identified and purified plasmid DNA from this clone was 
cleaved with restriction enzymes Pst I and Eco RI . The 

3 5 purified gene fragment encoding the divalent affinity 
module Z 8G -ABD-Z 8G was subsequently ligated into the 
expression vector pE318 (Jendeberg et al . , J. Immunol. 
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Methods, 201:25-34, 1997) cleaved with the same enzymes 
and the ligation mixture transformed to E. coli cells. 
One clone (no. 18) with a correct size of the insert as 
judged from agarose gel electrophoresis after cleavage 
5 of purified plasmid DNA was subjected to confirmatory 
DNA sequencing and was found to be correct. This clone 
was subsequently used for production of protein Z 8G -ABD- 
Z 8G protein which was purified by either or protein AG or 
HSA-affinity chromatography (Nygren et al . , 1988) from a 

10 periplasmic extract of an overnight culture of E . coli 
cells harboring the pE-Z 8G -ABD-Z 8G construct. The 
successful purification of the Z 8G -ABD-Z 8G fusion protein 
by the two principles HSA or protein AG-affinity 
chromatography showed that the ABD domain flanked at 

15 both the N- terminal and C- terminal by Z 8G domains as well 
as the Z 8G domains were functional in the fusion protein. 
Analysis by SDS-PAGE showed that the majority of the 
protein were of full length with a strong band of the 
apparent molecular weight of 19.7 kDa (Figure 5) . 

20 

Example 4. Analysis by native gel electrophoresis of 
self -assembled protein complexes. 

To investigate if the two affinity modules Z 8G -ABD- 
Z 8G and protein A were capable of forming protein 

2 5 complexes of higher order, the two proteins were mixed 

at different molar ratios and subsequently analyzed by 
native polyacrylamide gel electrophoresis (PAGE) . In 
concordance with a productive complex formation between 
the two affinity modules, a band corresponding to a 

3 0 larger form showing slower migration appeared in samples 

in which the two proteins had been mixed at different 
stoichiometrical ratios (Figure 6 , lanes 3 -5) . The 
appearance of this band indicated that a bimolecular 
complex between the Z 8G -ABD-Z 8G and the protein A affinity 
35 modules had been formed, since the intensity of the 
smaller molecular weight band of Z 8G -ABD-Z 8G decreased 
with the increasing intensity of the larger band. At 
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even higher relative concentrations of protein A, bands 
corresponding to even slower migrations appeared, 
suggesting that at these ratios, higher order networks 
(more than two affinity modules) had been spontaneously 
5 formed from biospecific interactions which were stable 
enough to hold together even when analyzed by the gel 
electrophoresis procedure. 

Example 5. Analysis by atomic force microscopy of self- 
10 assembled protein complexes. 

Analyses by atomic force microscopy (AFM) were 
performed to investigate if complexes formed between the 
two affinity modules Z 8G -ABD-Z 8G and protein A could be 
visualized. Mica surfaces were prepared and cleaned 
15 with alcohol and used to apply samples of proteins. Two 
different samples were analyzed in a Nanoscope Ilia AFM 
instrument (Digital Instruments) which was used to scan 
the surfaces in air. Instrumental settings were: Z- 
height=2 nm; scan rate=l . 5 Hz; tapping mode). In a first 

2 0 experiment, 2 0 /xl of a pure 2 0 nM Protein A solution was 

applied onto the mica surface and was allowed to dry in 
air. Analysis of this sample by AFM revealed spatially 
distributed spots with an apparent mean diameter of 
approximately 2 0 nm, indicating that protein A molecules 
25 were evenly distributed on the mica surface (Figure 7, 
left panel) . In contrast, when a second sample 
consisting of a mixture between protein A protein (10 
nM) and the divalent protein A-binding Z 8G -ABD-Z 8G 
affinity module (20 nM) was analyzed with the same 

3 0 instrumental settings, larger protein forms were clearly 

visible (Figure 7, right panel) . This indicates that 
the addition of the Z 8G -ABD-Z 8G affinity module capable of 
selective biospecific interaction with the five-domain 
protein A affinity module resulted in the spontaneous 
35 formation of multi protein complexes of higher order 
structures from building blocks held together by 
biospecific interactions . 



